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Replication protein A (RPA), the heterotrimeric single-stranded-DNA (ssDNA) binding protein (SSB) of eu- 
karyotes, contains two homologous ssDNA binding domains (A and B) in its largest subunit, RPA1, and a third 
domain in its second-largest subunit, RPA2. Here we report that Saccharomyces cerevisiae RPA1 contains a 
previously undetected ssDNA binding domain (domain C) lying in tandem with domains A and B. The carboxy- 
terminal portion of domain C shows sequence similarity to domains A and B and to the region of RPA2 that 
binds ssDNA (domain D). The aromatic residues in domains A and B that are known to stack with the ssDNA 
bases are conserved in domain C, and as in domain A, one of these is required for viability in yeast. Interest- 
ingly, the amino-terminal portion of domain C contains a putative Cys 4 -type zinc-binding motif similar to that 
of another prokaryotic SSB, T4 gp32. We demonstrate that the ssDNA binding activity of domain C is uniquely 
sensitive to cysteine modification but that, as with gp32, ssDNA binding is not strictly dependent on zinc. The 
RPA heterotrimer is thus composed of at least four ssDNA binding domains and exhibits features of both bac- 
terial and phage SSBs. 



Single-stranded-DNA (ssDNA) binding proteins (SSBs) par- 
ticipate in almost every aspect of DNA metabolism. These 
proteins bind tightly to ssDNA with little or no sequence spec- 
ificity and activate ssDNA by reducing its secondary structure 
and by stimulating DNA polymerases and DNA helicases. At 
first glance, the three well-studied SSBs encoded by phage T4 
(gp32), Escherichia coli (Ecssb), and eukaryotes (replication 
protein A [RPA]) appear to have evolved very different ways 
of accomplishing this task. Gene 32 protein is a monomer of 
33 kDa that binds ssDNA cooperatively (co = 3,800), with a bind- 
ing-site size of 8 nucleotides (nt) (19, 46). The structure of 
gp32 has been determined by X-ray crystallography and shown 
to contain a hydrophobic pocket composed of five p strands 
that contact the ssDNA bases and an electropositive cleft that 
contacts the phosphate backbone (52). While gp32 is a zinc 
metalloprotein, zinc is not required for ssDNA binding but is 
required for cooperativity and stabilization of the structure of 
the core fragment (19, 20, 43). The prototype bacterial SSB, 
Ecssb, is a tetramer of four identical 19-kDa subunits, or pro- 
tomers, with at least two well-characterized DNA binding 
modes (38). At low salt concentrations (<10 mM NaCl) two 
protomers bind 35 nt of ssDNA with unlimited cooperativity 
(id = 10 5 ) (18, 50), and at high salt concentrations (£:0.2 M 
NaCl) four protomers bind 65 nt of ssDNA with limited coop- 
erativity (co - 420) (9, 38, 39). This larger binding mode results 
in compaction of the ssDNA due to higher-order binding or 
wrapping about the tetramer (10, 12, 23). 

The nuclear SSB of eukaryotes, RPA, is a heterotrimeric 
complex that was originally identified as a Homo sapiens pro- 
tein (hsRPA) required for simian virus 40 (SV40) DNA rep- 
lication in vitro (17, 59-61). RPA has been identified in nu- 
merous species, including the yeast Saccharomyces cerevisiae 
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(scRPA), where it is a complex of subunits of 69 (RPA 1), 36 
(RPA2), and 13 (RPA3) kDa (8). Each subunit of RPA is 
required for SV40 DNA replication (16, 30) and for viability in 
yeast (7, 26). 

RPA1 is known to bind ssDNA on its own and for some time 
was thought to be the only subunit that binds ssDNA (8, 16, 30, 
62). Structure-function analysis revealed that the N-terminal 
18 kDa of RPA1 is dispensable for SV40 DNA replication (21) 
but that the C terminus, which contains a putative Cys 4 -type 
zinc-binding domain, is required for RPA2 binding (21, 35, 36). 
The central portion of hsRPAl (residues 168 to 442) contains 
a major ssDNA binding domain (21, 22, 35, 44). The notion 
that RPA might contain additional ssDNA binding domains 
was suggested by the fact that this domain has a 20-fold lower 
affinity for ssDNA than does the heterotrimer (21) and by the 
fact that different binding-site sizes have been reported for 
RPA. These include an 8-nt binding mode that is dependent on 
glutaraldehyde cross-linking (3), a 30-nt binding mode that is 
obtained by fluorescence quenching and electrophoretic mo- 
bility shift assay (EMSA) (2, 32-34, 41, 55), and a 90-nt binding 
mode that is obtained by fluorescence quenching and electron 
microscopy (EM) (1). These various sizes might be explained if 
RPA has multiple ssDNA binding domains and alternative 
binding modes like Ecssb (45). 

Genetic and biochemical analysis in yeast revealed that the 
major ssDNA binding domain of yeast RPA1 is composed of 
two homologous subdomains, A and B, with weak sequence 
similarity to Ecssb (45). Both domains are required for viabil- 
ity, and domain A can functionally replace domain B. Based on 
a model of ssDNA binding by Ecssb (11), a pair of conserved 
aromatic residues in each domain was identified by amino acid 
sequence alignment and proposed to stack with the ssDNA. One 
of these, F238, is likely to be important for ssDNA binding 
since it was the only residue in scRPAl that was individually 
required for viability. Portions of this model have been con- 
firmed by the crystal structure of residues 181 to 422 of hsRPAl 
(RPA1 181 ^ 22 ) bound to ssDNA (5). Bochkarev and colleagues 
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(5) showed that the central ssDNA binding domain of hsRPAl 
is composed of two structurally homologous subdomains, or 
"OB folds" (42) (for oligonucleotide oligosaccharide binding 
folds), and that binding by each domain involves a number of 
hydrogen bonds as well as the stacking of ssDNA bases with a 
pair of aromatic residues, one of which is F238. To closely 
resemble Ecssb, however, two additional ssDNA binding do- 
mains must exist in the RPA heterotrimer. One of these is in 
the middle subunit, since it was demonstrated that RPA2 can 
bind ssDNA as part of the heterotrimer (45) or as part of the 
RPA2-RPA3 subcomplex of hsRPA (6). Binding by the RPA2- 
RPA3 subcomplex of hsRPA is stimulated by the C terminus of 
RPA1, suggesting that this domain might contain another 
ssDNA binding activity (6). RPA3 was proposed to serve as a 
binding domain; however, no direct evidence has yet been 
obtained to support this idea (45). 

Here we show that the fourth ssDNA binding domain of 
RPA lies in the C terminus of RPA1, adjacent to domains A 
and B. This result confirms that both cellular SSBs, Ecssb and 
RPA, have at least four ssDNA binding domains. Curiously, 
the new domain in the C terminus of RPA1 has several fea- 
tures in common with the phage SSB gp32. This suggests that 
these very different SSBs are likely to have a common mech- 
anism of function, if not a common origin. 



MATERIALS AND METHODS 

DNA constructions. Fragments of the RFA 1 gene encoding scRPAi were 
isolated on Nde\/BamH\ cassettes by 12 rounds of amplification with Vent DNA 
polymerase and pJM136 (45) as the template. Fragments were ligated into 
pETUa to produce the following E. coli expression plasmids: pJM163 (Ml to 
K180), pJM158 (M, T181 to N294), pJM159 (M, V295 to D415), pJM165 (M, 
S416 to A4S5), pJM164 (M, S416 to N511), pJM157 (M, F481 to S601), pJM155 
(M, F481 to A621), and pJM185 (M, S416 to A621). Point mutations were 
introduced by two rounds of oligonucleotide-directed PCR mutagenesis, and the 
mutated sequences were ligated into pETl la for E. coli expression or into a yeast 
shuttle vector to test genetic complementation. The following domain C muta- 
tion expression plasmids were identical to pJM185 except for the indicated 
mutations: pJM174 (M, S416 to S601, with a C-terminal truncation of 20 amino 
acids in domain C) and pJM186 (C505A and C508A). To test whether an RFA1 
gene with point mutations could complement an RFA] deletion, we placed the 
mutations in the context of an intact RFA1 gene by using the "domain swap" 
vector pDSl (45) to produce pDS1.13 (F238A), pDS1.28 (F269A), pDSU4 
(W360A), and pDS1.29 (F385A). Plasmid pDSlC was constructed for placing 
domain C mutations into RFA1 by introducing a Sail site just downstream of the 
Cys 4 motif (resulting in the amino acid changes T510V and N51 ID) and a BgtU 
site near the end of domain C (resulting in the amino acid change Y594S). 
Plasmid pDSlC was shown to confer wild-type activity by a complementation test 
and was used to place mutations in RFA J, which resulted in the following plas- 
mids: pJM183 (C505A and C508A), pDSlC9 (F537A), and pDSlCIO (F563A). 
Complementation with the RPA2 F143A mutation was tested with pJM243 (45), 
and complementation with the RPA2 W101A mutation was tested with pJM254 
(W101 A), which was constructed in pDS2 (45). Maltose binding protein (MBP) 
fusions were constructed in the vector pMAL-c2 (New England Biolabs) by 
inserting BamHl fragments that were generated by PCR as described above into 
the unique BamHl site of the vector. Plasmid pJM410 expresses the MBP-A 
domain fusion (T181 to N294), pJM411 expresses the MBP-B fusion (V295- 
D4I5), and pJM412 expresses the MBP-C fusion (G418-A621). Genetic comple- 
mentation of the rf'aI-l::TRPl and rfa2-l::TRPl null mutations was performed as 
described previously (45). 

Protein expression, purification, and ssDNA binding assay. Recombinant 
RPA1 proteins were expressed in strain BL21 (DE3) by using the T7 expression 
system (54). Cells were grown in Luria-Bertani medium at 37°C in the presence 
of ampicillin and induced for 16 h by the addition of 0.4 mM 1PTG (isopropyl- 
3-D-thiogalactopyranoside). Cells were harvested and lysed as described previ- 
ously (45), and the insoluble pellet from 6 ml of culture was resuspended in 0.3 
ml of 10 M urea. The sample was then diluted with 0.3 ml of 2x buffer A (25 mM 
Tris [pH 7.5], 1 mM EDTA, 0.01% Nonidet P-40, 10% glycerol, 0.1 mM phe- 
nylmethylsulfonyl fluoride, 1 mM dithiothreitol [DTT]) containing 200 mM 
NaCl, and the sample was centrifuged 15 min at 4°C. The soluble portion was 
then sequentially dialyzed to 2, 1, 0.5, and 0 M urea in dialysis buffer consisting 
of buffer A and 100 mM NaCl, 10 mM MgCl 2 , and 20 \lM ZnS0 4 . Each dialysis 
step was carried out for 8 to 16 h. Precipitates were removed by centrifugation, 
and the soluble sample, typically 0.3 mg of recombinant protein per ml, was 
stored at -80°C Protein concentrations were determined by comparison to 



known standards on Coomassie blue-stained sodium dodecyl sulfate (SDS)- 
polyacrylamide gels. 

MBP fusion proteins were expressed in strain XL 1 -Blue grown in Luria- 
Bertani medium with 0.2% glucose and 0.1 mg of ampicillin per ml. A 1-liter 
culture was induced for 2 h with 0.3 mM IPTG, and the cells were collected by 
centrifugation and resuspended in buffer A containing 50 mM NaCl and 1 mg of 
lysozyme per ml. All buffers for MBP-C also contained 20 p.M ZnS0 4 through- 
out the purification. All steps were performed either on ice or at 4°C. Following 
a 15-min incubation on ice, the samples were subjected to one freeze-thaw cycle 
and four sonication cycles (15 s each). The lysate was clarified by centrifugation, 
diluted fivefold with buffer A containing 50 mM NaCl, and applied to a 15-ml 
amylose-atfinity resin. The resin was washed with 8 column volumes of buffer A 
containing 300 mM NaCl, and the MBP fusion proteins were eluted in a step with 
buffer A containing 50 mM NaCl and 10 mM maltose. Peak fractions were 
identified by the Bradford assay, pooled, and loaded onto a Mono Q column. 
This column was washed with 3 ml of buffer A containing 50 mM NaCl and 
eluted with an 8-ml gradient from 50 mM to 1 M NaCl; The peak fractions (0.15 
M NaCl) were identified by SDS-polyacrylamide gel electrophoresis (PAGE), 
pooled, and dialyzed to buffer A containing 50 mM NaCl. 

Typically, 10 p.1 of ssDNA binding reaction mixtures containing protein ex- 
tracts from £. coli were incubated with a 10,000 cpm (2 fmol) of * 2 P-labeled 
17-base oligonucleotide (universal sequencing primer) in the following solution: 
10 mM HEPES (pH 7.5)-0.5 mM DTT-10 u-g of sheared salmon sperm DNA 
per ml. Reaction mixtures were cross-linked and analyzed as described previ- 
ously (45). Binding reactions for MBP fusion proteins were carried out under 
identical conditions except that they were UV cross-linked at 500 J/m 2 to min- 
imize the time of UV exposure. Gels were analyzed with a phosphorimager, and 
the intensities of the bound and free probes were quantitated with IP-Lab Get 
software. The K d for each binding reaction was then determined by fitting the 
data to the Langmuir equation. 

Amino acid sequence analysis. The amino acid sequences of the four domains 
shown in Fig. 3 were initially aligned with the PILEUP program. This alignment 
identified the absolutely conserved aspartic acid as well as the invariant N-ter- 
minal aromatic residue in all four domains. An aromatic residue corresponding 
to the C-terminal stacking residue was found to be conserved within each of the 
four domains but was not aligned between domains. The residues at this position 
in domains A and B were aligned with each other based on the structural 
alignment of hsRPAl 181-422 (5), which consisted of shifting the stacking residue 
of domain B three residues upstream. The aromatic residue in domain D had 
aligned with this residue in domain B and was similarly shifted three residues. 
The aromatic residue of domain C was shifted three residues downstream. The 
PHD structure prediction program has been described previously (47-49). 

PMPS inactivation and reactivation. p-Hydroxymercuriphenylsulfonate (PMPS) 
treatment was performed essentially as described previously (15, 20). All buffers 
and water were first passed over 30 ml of Chelex-100 resin (Bio-Rad), and the pH 
was adjusted with NaOH. Extracts were prepared without zinc and dialyzed twice 
against 1,000 volumes of buffer A lacking DTT and containing 10 mM EDTA. 
Concentrations of samples were made to be 2.5 mM in PMPS (Sigma) for 1 h on 
ice and adjusted to 50 mM in EDTA. Sensitivity to PMPS was determined by 
assaying the sample as usual, except that the assay buffer did not contain DTT. 
Reactivation was performed by dialyzing the PMPS- treated sample against TNG 
buffer (25 mM Tris [pH 7.5], 0.2 M NaCl, 5% glycerol) containing 1 mM EDTA 
to remove unreacted PMPS. The samples were then incubated with 0.1 M 
p-mercaptoethanol and increasing concentrations of zinc sulfate for 30 min on 
ice. Samples were assayed in the absence of DTT. 

RESULTS 

Identification of a new ssDNA binding domain in RPA1. A 

UV-cross-linking EMSA was previously used to characterize 
the ssDNA binding activity of domains A and B of scRPAi 
(45). In this method, an oligonucleotide 32 P labeled at its 5' 
end is incubated with an extract made from E. coli expressing 
a defined fragment of RPA protein and the mixture is irradi- 
ated with UV light to covalently link bound protein to the 
oligonucleotide. Activity is then detected by gel shift assay on 
a denaturing polyacrylamide gel. The experiments described 
here use the 17-nt universal sequencing primer to minimize the 
number of multiple protein-ssDNA interactions, but similar 
results were obtained with oligo(dT) 40 . 

To search for additional ssDNA binding domains within the 
RPA1 subunit, fragments of the yeast gene encoding RPA1 
were cloned into an £. coli expression vector such that the 
entire RPA1 protein could be expressed as several protein 
fragments (Fig. 1). Upon induction and cell lysis, each RPA1 
fragment was found in the insoluble fraction. These proteins 
were solubilized in urea and refolded by stepwise dialysis. 
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FIG. 1. Identification of a new ssDNA binding domain in RPA1. Fragments 
of the scRPAl protein were expressed in £ coli and tested for ssDNA binding 
activity by the UV EMSA. At the top is a schematic of the scRPAl protein 
illustrating the locations of the three ssDNA binding domains and the eight 
protein fragments (numbered horizontal black bars) that were tested in this 
assay. Numbers above the schematic refer to the amino acid numbers of the 
scRPAl protein. For each protein fragment, a portion of extract containing 
approximately 0.3 p.g of recombinant protein was incubated with a 17-nt oligo- 
nucleotide 32 P labeled at its 5' end (P) and cross-linked with UV light. Products 
were resolved on a 6% denaturing polyacryl amide gel. V, extract made from 
£ coli expressing the vector alone; A*, B*, and C*, RPA1 domains A, B, and C; 
bkg, background band. 

SDS-PAGE revealed that all RPA1 fragments were soluble 
and present in nearly equal amounts except for the smallest 
one (fragment 4), which was about fourfold less concentrated 
(data not shown). Portions of each extract were then subjected 
to the UV EMSA, and as was shown previously (45), fragments 
encompassing domains A and B resulted in a clear mobility 
shift (Fig. 1). At least two background bands were observed 
with all extracts, including extracts of £. coli expressing the 
vector alone (lane V). A strong mobility shift was also seen in 
assays of fragment 8, which encompasses residues 416 to 621. 
Assays of smaller fragments within this region (fragments 4 to 
7) did not result in binding; however, we cannot rule out the 
possibility of a signal comigrating with the background bands. 
It should be noted that the RPA binding activities in this assay 
resulted in a corresponding decrease in the signal of the major 
background band (Fig. 1). This effect was due to limiting 
amounts of probe used in this experiment and was not ob- 
served when the ratio of probe to extract was increased (data 
not shown, but see Fig. 2). We also note that the signal ob- 
tained in the UV EMSA depends on both a protein's ssDNA 
binding affinity and its efficiency of cross-linking, which may 
differ between proteins. The new binding domain located in 
the 206 amino acids C terminal to domain B is hereafter 
referred to as domain C The ssDNA binding domain in RPA2 
(6, 45) is here renamed domain D. 

To determine the specificity of domain C binding, UV EMSAs 
were performed on extracts of E. coli expressing domains A, B, 
and C in the presence of competitor nucleic acid. As shown 
previously, binding by domains A and B is competed by the 
presence of unlabeled ssDNA but not by the presence of un- 
labeled double-stranded DNA (dsDNA) (Fig. 2; e.g., compare 



lanes 2 and 3 to lanes 4 and 5). The ssDNA binding activity of 
these two domains is also resistant to added RNA as the 
competitor (e.g., compare lanes 1 and 6). Domain C shows an 
identical pattern: it is specifically competed by ssDNA but not 
by dsDNA or RNA (lanes 13 to 18). 

Amino acid sequence analysis of domain C. To identify 
residues of domain C that might contribute to its ssDNA bind- 
ing activity, the amino acid sequences of domains A, B, C, and 
D (RPA2) from a number of species were compared (Fig. 3). 
The alignment of domains A and B was anchored by using the 
structural comparison of hsRPAl 181 _4 2 2 described by Boch- 
karev et al. (5), and presented at the top of the alignment is a 
schematic of the corresponding secondary structure of this 
region. As described previously, this structure is an OB fold 
which consists of five strands of p-sheet with an a-helix con- 
necting the third and fourth strands (42). A pair of aromatic 
residues in hsRPAl domains A and B (one at the end of strand 
3 and one in the loop between strands 4 and 5) make stack- 
ing interactions with the DNA bases. Several nonconserved 
residues in 0-strands 1 and 2 (a structure termed the p-hair- 
pin) make hydrogen bonds with the phosphate backbone 
and the DNA bases. A very similar structure in gp32 indi- 
cated that the ssDNA binding pocket of gp32 is also an OB 
fold (5, 52). 

Highlighted in Fig. 3 are three residues that are conserved in 
all four ssDNA binding domains. One residue is an absolutely 
conserved aspartic acid that lies at the end of the p2 strand. 
This residue was previously noted for being conserved in do- 
mains A and B (45), and mutation of this residue produces 
phenotypes in yeast when it is mutated in domains A (53), B 
(45), and D (RPA2) (40, 51). Approximately 10 residues down- 
stream of the conserved aspartic acid is an invariant aromatic 
residue that lies at the end of the p3 strand and is homologous 
to the N-terminal stacking residue, F238, of hsRPAlA. Ap- 
proximately 30 residues further downstream is another aro- 
matic residue which is conserved in all domains except RPA4 
(31). This aromatic residue lies in the loop between strands 34 
and P5 and is homologous to the C-terminal stacking residue 
F269 of hsRPAlA. 

While domains A and B of all species probably consist of OB 
folds like their human homologs, the alignment in Fig. 3 im- 
plies that domains C and D (RPA2) are also OB folds. With 

Domain: A B C V 

Competitor: - - ^1^3 i - ^ J^] i - ss 




FIG. 2. RPA1 domain C binds ssDNA specifically. Extracts of £ coli con- 
taining approximately 0.3 jig of recombinant protein were assayed in the pres- 
ence of the indicated competitor, namely, 0.5 or 3 u.g of unlabled lambda DNA 
before (double stranded [DS]) or after (single stranded [SS]) boiling or 3 ^.g of 
yeast t RNA (RNA). V, extract of £ coli expressing the vector alone or competed 
with 3 M-g of ssDNA. A*, B*, and C*, RPA1 domains A, B, and C. 
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FIG 3 Amino acid sequence alignment of the four ssDNA binding domains of RPA. The amino acid sequences of the indicated proteins were aligned with the 
PILEUP program and optimized as described in Materials and Methods. The secondary structure of hsRPAl 181 ^ 22 , presented at the top of the sequences, was ; taken 
from Bochkarev et al (5) Highly conserved residues are indicated in white type on a black background, and the putative zinc-binding domain in domain C is shaded. 
Numbers to the right of the protein designations refer to amino acid numbers. Note that 14 amino acids are deleted from the scRPA2 sequence at -residue 115. Hs, 
H sapiens; XI, Xcnopus laevis\ Sc, 5. cercvisiae; Sp, Schizosaccharomyces pombe; Os, Oryia sativa\ Cf, Crithidia fasiculata\ Ce, Cacnorhabditis elegatis. 



domain C, however, it cannot be so simple, since the pi strand 
is replaced by the C-terminal portion of the Cys 4 motif (Fig. 3). 
But the position of the Cys 4 motif relative to the positions of 
the remaining secondary structures is similar to that of the 
zinc-binding protein gp32. Specifically, the p-hairpin of gp32 
was shown to be interrupted by the zinc-binding domain (5, 
52). This may be the case in domain C as well. 

A secondary-structure prediction program was used to de- 
termine whether the sequences of domains C and D were 
consistent with that of an OB fold. This program accurately 
predicted the OB fold domains of gp32 as determined by its 
crystal structure (52). Further, the secondary structures of all 
species of RPA1 domain C were found to be similar to each 
other, and the results for the yeast and human proteins are 
presented in Fig. 4A. In both species, a p-sheet is predicted 
immediately upstream of the Cys 4 motif and may be the pi 
strand of an OB fold. Directly following the Cys 4 motif is 
another p strand (P2) followed by a third shorter p strand (P3) 
and an a-helix. A p-sheet is predicted near residues 580 and is 
proposed to be p5. A discrepancy between this analysis and the 
OB fold of gp32 is that an a-helix is predicted to exist where p4 
is expected. Thus, domain C may not be a typical OB fold. 
With domain D, the secondary-structure prediction contained 



all five p-sheets and the a-helix (Fig. 4B) (6). It is worth noting 
that all the components of the hypothetical OB fold in domain 
D lie within the minimal region of scRPA2 that is required for 
viability in yeast (residues 40 to 173) (45). What is remarkable 
about both secondary-structure predictions is that each of the 
conserved residues identified in Fig. 3 lies at the expected 
position relative to the secondary structures of the OB folds in 
hsRPAl: the invariant aspartic acid closely follows P2, the first 
aromatic closely follows a shorter p3, and the second aromatic 
lies in the loop between p4 and p5. Although this loop is 
predicted to be helical in domain C, so also were helices pre- 
dicted in the p4-p5 loop of yeast and human domains A (data 
not shown). 

Mutational analysis of domain C. The sequence analysis 
described above makes predictions regarding the roles of the 
Cys 4 motif and. potential stacking residues in domain C func- 
tion. To test these predictions, mutations were generated in 
domain C and the mutant proteins were expressed in E. coli 
and assayed for ssDNA binding activity. Wild-type domain C 
and domain C with a C-terminal truncation of 20 amino acids 
are active in ssDNA binding, indicating that the extreme C 
terminus is not required for binding (Fig. 5). But, changing 
cysteines 505 . and 508 to alanine resulted in a recombinant 
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FIG. 4. Secondary-structure predictions of domains C and D are consistent with an OB fold. Amino acid sequences of domains C and D from yeast and humans 
were analyzed by the Predict Protein program, European Molecular Biology Laboratory, Heidelberg, Germany (48). The predicted secondary structures of domains 
C (A) and D (B) are displayed above the yeast (scRPAlC and scRPA2, respectively) and below the human (hsRPAlC and hsRPA2, respectively) sequences. The 
sequence of the Cys 4 motif (A) and the three highly conserved residues within each amino acid sequence (A and B) are highlighted in white type on a black background. 
The proposed £ strands of the OB fold are indicated. H, helix; E, extended $-sheet. 



protein that could no longer bind ssDNA. Similar mutations in . 
the Cys 4 motif of hsRPA are known to reduce or eliminate 
RPA activity in SV40 DNA replication (35, 36) and mismatch 
repair (37). Consistent with the role of RPA in these essential 
processes, complementation analysis revealed that the C505A- 
C508A double mutation is lethal in yeast (Table 1). 

To further analyze the role of the Cys 4 motif in ssDNA 
binding, domain C activity was tested for sensitivity to the re- 
versible cysteine-modifying reagent PMPS (15, 20, 27). Unlike 
domains A and B, treatment of domain C with PMPS abol- 
ished ssDNA binding activity (Fig. 6A). Ecssb was resistant to 
PMPS treatment, as was the background binding present in the 
extract. This result is consistent with genetic and biochemical 
evidence supporting a role for this domain in RPA function 
(35-37). 

ToTest whether a divalent metal was required for binding by 
domain C, the PMPS-modified protein was treated with high 
concentrations of EDTA and extensively dialyzed to remove all 
metals. The protein modification was then reversed with the 
reducing agent p-mercaptoethanol. As before, the PMPS-mod- 
ified protein was inactive and could not be stimulated by the 
presence of 1 mM zinc sulfate (Fig. 6B). However, reversal 
of the modification with p-mercaptoethanol restored ssDNA 
binding activity even in the absence of added zinc. Titration of 
zinc sulfate into the reaction mixture had only a slight stimu- 



lating effect on binding. These results indicate that while re- 
combinant domain C protein is sensitive to cysteine modifica- 
tion, zinc is not strictly required for ssDNA binding. 

The role of the proposed stacking residues in RPA function 
in vivo was tested by mutation and complementation analysis 
in yeast. The N-terminal stacking residue of each domain, 
analogous to F238 in the f33-a-helix interval of hsRPAlA, was 
mutated to alanine. This mutation was found to be lethal in 
domains A and C but not in domains B and D (Table 1). The 
proposed C-terminal stacking residues were similarly tested. 
None of these C-terminal mutations had any noticeable effect 
on viability or growth rate when they were tested for comple- 
mentation in yeast (Table 1). Thus, with respect to viability, the 
phenotypes of corresponding mutations in domains A and C 
are identical. Remarkably, of over 60 point mutations that we 
have made in scRPAl, only F238 and F537 were individually 
essential for viability. The fact that these residues are con- 
served by amino acid sequence alignment suggests that they 
perform the same function in ssDNA binding. 

Quantitation of ssDNA binding affinity. In order to measure 
the affinity of ssDNA binding by domains A, B, and C, we 
attempted to purify them from the E. coli extract. Unfortu- 
nately, fractionation of the refolded proteins caused them to 
precipitate quantitatively. To facilitate the recovery of soluble 
protein, we fused each of the RPA domains to MBP and pu- 
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FIG. 5. ssDNA binding activity of domain C requires the Cys 4 motif. Extracts 
of E. coli expressing domain C of scRPAl with the indicated mutations were 
assayed by U V EMS A. Vector is an extract of E. coli expressing the vector alone. 
AC20 is domain C with a C-terminal truncation of 20 amino acids. WT, wild type. 



rifled them by affinity and ion-exchange chromatography. Fig- 
ure 7A shows an SDS-PAGE analysis of the purified proteins 
which were estimated to be 95% pure. The MBP-C fusion ap- 
pears as a ladder of bands. This pattern was obtained on mul- 
tiple purifications and was insensitive to the amount of zinc 
sulfate present during purification. We assume that these bands 
arise from limited proteolytic degradation. 

Increasing amounts of each MBP fusion protein were incu- 
bated with a fixed amount of probe and UV cross-linked, and 
the products were resolved by denaturing gel electrophoresis. 
As expected, the MBP-A, MBP-B, and MBP-C fusion proteins 
possessed a ssDNA binding activity that was not present in 
purified MBP alone (Fig. 7B). Note that at high levels of MBP- 



A a slower-moving band which may represent multiple proteins 
binding to a single oligonucleotide is observed. To confirm that 
the binding of these proteins to ssDNA was specific, we per- 
formed two additional experiments. First, a fixed amount of 
each MBP fusion protein was incubated with increasing amounts 
of probe to 'test whether the binding could be saturated with 
ssDNA substrate. As shown in Fig. 8A; increasing amounts of 
probe resulted in increasing amounts of DNA binding until the 
signal reached a plateau at 50 pmol of input oligonucleotide (3- 
to 10-fold molar excess). Second, we tested that the binding of 
MBP fusion proteins to ssDNA was in equilibrium by reversing 
the probe-binding with unlabeled ssDNA*. Following a stan- 
dard incubation of MBP fusion protein with probe, unlabeled 
oligonucleotide was added and a second incubation was per- 
formed. The products were then subjected to UV cross-linking 
and gel electrophoresis. The presence of increasing amounts 
of competitor oligonucleotide resulted in decreased levels of 
probe binding by each MBP fusion protein, indicating that 
the binding of all MBP fusion proteins was specific (Fig. 8B). 
Again, no binding was seen by MBP alone. The amounts of 
free and bound probes present in protein titrations of MBP 
fusion proteins were quantitated by phosphorimaging, and dis- 
sociation constants were calculated. Based on these calcula- 
tions we find that the binding affinities of MBP-A and MBP-B 
are approximately equal and two to three times greater than 
that of MBP-C. 



RPA, like its prokaryotic homologs, is involved in multiple 
aspects of DNA metabolism (58, 60), including roles in DNA 
replication (17, 59, 61), DNA repair (13^ 24), and genetic re- 
combination (25, 53). While the role of the RPA complex in 
these processes is well documented, the functions of RPA's 
individual subunits are not well understood. It has been noted 
that RPA resembles Ecssb in that both cellular SSBs are mul- 
timeric complexes; RPA is a heterotrimer and Ecssb is a ho- 
motetramer. The identification of ssDNA binding domains A 
and B in RPA1 and a third domain in RPA2 (5, 6, 45) sug- 
gested that RPA might contain a fourth ssDNA binding do- 
main analogous to that in Ecssb. The identification of a new 
ssDNA binding domain in the C terminus of RPA1 supports 
the idea that cellular SSBs require multiple ssDNA binding do- 
mains to carry out their essential function. 

The putative zinc-binding domain in the C terminus of RPA1 
has long been recognized and compared to that in gp32 (16, 
26), but its role in RPA function was not obvious. C-terminal 
truncations of RPA1 are known to eliminate SV40 DNA rep- 
lication and the RPA1-RPA2 interaction but to only slightly 
reduce the ssDNA binding activity of the 70-kDa subunit (21, 
22, 35, 36). The essential role of the RPAl C terminus in SV40 
DNA replication may therefore be explained by its effect on 
trimer formation rather than on ssDNA binding activity. How- 
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TABLE 1 . Mutational analysis of conserved residues in RPA 



_ . . _ Mutation at interval w-.u;i;^ Mutation at interval viahilitv Mutations at Viabilitv 

Subunit Domain (33-a-helix Viability- p4 _ p5 Viability jmerva , pi viaouity 

RPA1 A F238A - F269A + NA b NA 

RPA1 B W360A + F385A -f NA NA 

RPAl C F537A - F563A + C505A and C508A 

RPA2 D W101A + F143A + NA NA 

" Ability of strain SB Y 102 (domains A, B, and C) or SBY105 (domain D) bearing a single-copy plasmid with the indicated mutation in RPAl or RPA2 to form 
colonies on 5-fluoro-orotic acid as described previously (45). 
b NA, not assayed. 
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FIG. 6. Domain C binding activity is sensitive to cysteine modification. (A) 
Extracts of £. colt expressing the indicated domains of scRPAl or purified Ecssb 
(ssb) were incubated in the presence or absence of 2.5 mM PMPS for 1 h, made 
50 mM in EDTA, and assayed for ssDNA binding activity by UV EMSA. (B) An 
extract of E. coli expressing domain C was treated with 2.5 mM PMPS for 1 h, 
made 50 mM in EDTA, and dialyzed against Chelex-100-treated TNG buffer 
containing 1 mM EDTA. The modified protein was then treated with the indi- 
cated reagents for 30 min before assay by U V EMSA. (3ME, 0-mercaptoethanol; 
bkg, background. 



ever, the fact that point mutations in the Cys 4 motif do not 
disrupt the RPA1-RPA2 interaction but still eliminate SV40 
DNA replication in vitro (35, 36) suggests that it is the ssDNA 
binding activity of this domain that is important for function. 
Indeed, the role of the Cys 4 motif in the ssDNA binding ac- 
tivity of domain C was confirmed both with point mutations 
and by chemical modification (Fig. 5 and 6). We conclude that 
the ssDNA binding activity of this domain is essential for RPA 
function and that the C-terminal region of RPA1 is required 
for both ssDNA binding and complex formation. The dual 
role of the RPA1 C terminus might explain why. the second- 
ary structure of this domain was predicted to have an a-helix 
where the (S4 strand of the OB fold was expected. Experiments 
to test whether this helix contributes to the additional activity 
of binding RPA2 are in progress. 

Having demonstrated that the Cys 4 motif is essential for 
domain C activity, we were surprised to find that zinc is not re- 
quired for ssDNA binding. While the possibility of trace metal 
contamination cannot be ruled out in this experiment, precau- 
tions were taken to avoid such impurities. For example, all buff- 
ers were treated with Chelex-100 resin, which removes divalent 
metals (15, 27), and different preparations of water and p-mer- 
captoethanol gave identical results. The Cys 4 motif of domain 
C may therefore function like the zinc-binding domain of gp32, 
which does not require zinc for ssDNA binding but for coop- 
erativity of binding (19, 43). It has been shown that apo-gp32, 
which lacks zinc, binds a single-site substrate with the same af- 
finity as metallo-gp32 (19). In contrast, the binding affinity of 



metallo-gp32 was higher than that of apo-gp32 on larger sub- 
strates due to an increase in cooperativity. The binding-site 
size of the 206-amino-acid domain C protein has not.been de- 
termined, but it is possible that the small stimulation of binding 
by zinc seen in Fig, 6B is due to enhanced cooperativity of 
binding to the 17-nt substrate used in this study. Confirmation 
of a role for zinc in this process awaits further experiments with 
larger substrates. 

The data presented here lead to a revised model of RPA 
(45). This model proposes that the RPA heterotrimer consists 
of at least four ssDNA binding domains that are each essential 
for RPA activity and likely to serve distinct roles in RPA 
function. Domains A, B, and C lie within RPA1, and domain D 
lies within RPA2. The three domains of RPA1 constitute a 
very strong binding site and may bind ssDNA simultaneously at 
low concentrations of salt. At 250 mM NaCl, domain D is 
capable of binding ssDNA as well (45). Since domain D lies 
within another subunit, it is possible that higher-order binding 
or wrapping results from the interaction of ssDNA with do- 
main D, leading to compaction of the ssDNA. While EM has 
provided some evidence for ssDNA wrapping by yeast RPA (1) 
and salt-dependent compaction of ssDNA by human RPA 
(56), investigators in the latter study suspected that factors 
other than wrapping were the cause of compaction. 

This model may explain the discrepancy in ssDNA binding- 
site sizes that have been reported for RPA. Given that RPA is 
a complex of at least four different binding domains, it is 
reasonable to expect multiple binding-site sizes, perhaps sim- 
ilar to that of Ecssb. EM studies of hsRPA bound to oligonu- 
cleotides suggested that the 8-nt binding mode, which is iden- 
tified by glutaraldehyde cross-linking (3), is an initial event that 
subsequently resolves into a stable 30-nt mode (4). Based on 
the known interaction of domains A and B with 8 nt of ssDNA 
(5), it is likely that this weak initial mode arises by cross-linking 
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FIG. 7. ssDNA binding activity of MBP-RPA fusion proteins. (A) Two mi- 
crograms of each purified MBP fusion protein was resolved by SDS-10% PAGE 
and stained with Coommassie blue. (B) Increasing amounts of the indicated 
MBP fusion protein or native MBP were incubated with a fixed amount of probe, 
UV cross-linked, and resolved by denaturing gel electrophoresis. Protein titra- 
tions are 0.1, 0.3, 1, 3, and 10 u.g in 15-u.l binding reaction mixtures. 
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of ssDNA to these two domains (28). The current model then 
suggests that interaction of ssDNA with domains A, B, and C 
leads to the stable 30-nt mode, which was observed to have a 
distinctly elongated appearance by EM (4, 28). As described 
above, interactions between ssDNA and all four domains may 
account for the 90-nt mode in the presence of high concentra- 
tions of salt (1). Alternatively, it is possible that the 8- and 
30-nt binding modes represent interactions with domain A and 
the A-B pair, respectively. The 90-nt mode might then arise 
from the binding of the C-D pair. Further experiments will be 
needed to distinguish between the multiple possibilities arising 
from these redundant domains. These studies will also need to 
address the role of metals in binding-site size and cooperativity 
of ssDNA binding by RPA. 

If RPA is truly a structural and functional homolog of Ecssb, 
then the analysis of RPA provides a unique opportunity to 
study the role of higher-order binding in DNA metabolism. 
Since each of the four protomers of Ecssb are equivalent, it is 
not possible to ask if each of them is essential to SSB function, 
or whether they perform specific functions in the homotet- 
ramer. With RPA, each of the four protomers is distinct and 
each has now been shown to be required for yeast viability (45) 



FIG. 8. ssDNA binding activities of M BP- RPA fusion proteins aresaturable 
and reversible. (A) Increasing amounts of input DNA were incubated with 0.3 p.g 
of MBP-A (5.4 pmol), 0.3 ^g of MBP-B (5.4 pmol), or 1 u.g of MBP-C (15 pmol) 
and assayed by UV EMS A. MPB-C values correspond to the l/20th scale on the 
right, oligo, oligonucleotide. (B) One microgram of each MBP fusion protein was 
preincubated with probe (10 fmol) and then incubated with 0, 1, 3, 10, 30, 100, 
and 300 pmol of unlabeled oligonucleotide. MBP alone was incubated with 0, 100, 
and 300 pmol unlabeled oligonucleotide. The last lane is a control lacking protein. 



and for SV40 DNA replication in vitro (22, 30, 35, 36). While 
there is some evidence for functional redundancy in RPA 
(domain A can substitute for domain B), each domain may 
have a specific function since domain B cannot substitute for 
domain A (45) and preliminary experiments in this lab indicate 
that domains A, B, and D cannot function in place of domain 
C in yeast. Recently, a detailed mutagenesis of the RPA1 
subunit in yeast revealed that point mutations placed through- 
out its length can differentially affect its multiple functions in 
DNA metabolism (58). Such a result would be expected if the 
domains of RPA1 have specific functions. Other mutant stud- 
ies indicate that conditional-lethal alleles of RPA2 are defec- 
tive in replication fork movement at the nonpermissive tem- 
perature (40, 51). The ssDNA binding activity of RPA2 may 
therefore be critical to RPA function in the elongation phase 
of DNA replication. Further in vitro experiments are needed 
to reconcile the different results in studies of higher-order 
binding by RPA and to test whether domains C and D play a 
role in wrapping ssDNA. 

What advantage could a multimeric SSB have over a mono- 
meric one? A clue may be found by comparing the efficiencies 
of various SSBs in T-antigen-catalyzed unwinding (29) and 
unwinding of a pseudo-origin template (28). It is curious that 
most replicative SSBs that work in these assays (RPA, Ecssb, 
adenovirus DBP, and herpesvirus ICP8) are known to bind 
ssDNA in a multimerized fashion. RPA and Ecssb are now 
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known to be multimers, and it has been shown that DBP uses 
a C-terminal hook to multimerize (57) and drive strand dis- 
placement synthesis (14). While the mechanism of ssDNA 
binding by ICP8 has not been determined, it is possible that 
this large protein of 128 kDa also contains multiple ssDNA 
binding domains. gp32 and T7 gp2.5, which bind ssDNA with 
high affinity, being monomers, do not function in these un- 
winding assays (28, 29), and mutant RPA complexes lacking 
one or more ssDNA binding domains are capable of only 
minor levels of unwinding (21). These findings suggest that the 
multiple domains of RPA play an important role in denaturing 
double-stranded DNA. It will be of interest to test this idea and 
determine whether RPA domains C and D regulate this pro- 
cess in eukaryotic cells. 
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